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Clostridium drakei SLl^ is a strictly anaerobic, H2-utilizing, and acid-tolerant acetogen isolated from an acidic sediment that is a 
potential platform for commodity chemical production from syngas fermentation. The draft genome sequence of this strain will 
enable determination of the acid resistance and autotrophic pathway of the acetogen. 
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Acetogenic bacteria produce over 10'^ kg of acetic acid annu- 
ally, which comprises 10% of the global acetate production 
(1). In this process, acetogens autotrophicaUy reduce COj to ace- 
tate with as the sole electron donor through the reductive acetyl 
coenzyme A (acetyl-CoA) pathway, also known as the Wood- 
Ljungdahl pathway (2). In addition, CO, one of the most abun- 
dant syngases, can be utilized for autotrophic growth as the sole 
electron donor. Because of these metabolic capabilities, acetogens 
are an attractive platform for the production of useful multi- 
carbon compounds by syngas fermentation. Among acetogenic 
clostridial species, Clostridium drakei SLl^ is a unique acetogen 
isolated from the sediment of an acidic coal mine pond (3). More- 
over, to date, most of the acetogens have been isolated from a 
habitat with neutral pH, which suggests that an acidic environ- 
ment is unfavorable for acetogens (4). Further, C. drakei utilizes a 
wide range of substrates such as glucose, fructose, arabinose, and 
xylose to produce acetate as the main reduced end product (3). To 
understand the unique physiological properties, including acid 
resistance, and metabolic capabilities of syngas utilization of 
C. drakei, we obtained its genome sequence information. 

The C. drakei genome was sequenced using Illumina MiSeq 
with a 2X150-cycle paired-end platform (Illumina, San Diego, 
CA). The sequencing library was constructed using a TruSeq DNA 
sample prep kit (Illumina). Before sequence assembly, phiX se- 
quences were removed and the reads were trimmed by their qual- 
ities (low-quality sequence, 0.05 limit, and ambiguous nucleo- 
tides, maximum of 2 nucleotides allowed) using CLC Genomics 
Workbench (CLC Bio, Aarhus, Denmark). We obtained a total of 
4,441,384,356 bases in 30,418,087 reads, which were then assem- 
bled using CLC Genomics Workbench (minimum contig length, 
302; automatic bubble size, yes; word size, 61; perform scaffold- 
ing, yes) . The assembly resulted in 140 contigs, the largest of which 
is 1,132,561 bases. The draft genome sequence was annotated us- 
ing the RAST server (5). 

The resulting draft genome sequence of C. drakei comprises 
5,635,531 bases, with a 35% G-hC content and 5,763 predicted 
protein-coding sequences (CDSs). Eleven rRNAs and 100 tRNAs 



were predicted using the RNAmmer 1.2 server (6) and 
tRNAscan-SE 1.23 (7), respectively. The genome of C. drakei is 
larger than those of other acetogenic clostridial species such as 
C. ljungdahlii (4.6 Mbp) (8) and Moorella thermoacetica (2.6 Mbp) 
(2) but smaller than those of C. ultunense Esp (6.2 Mbp) (9) 
and C. methoxybenzovorans (7.0 Mbp) (accession number 
ATXDOOOOOOOO). C. drakei possesses the metabolic pathway re- 
quired to produce butanol from acetyl-CoA, which is mediated by 
the NADH-dependent butanol dehydrogenase. C. acetobutylicum 
and C. carboxidivorans (5.6 Mbp), which is closely related to 
C. drakei, also possess this enzyme ( 10). An in-depth study of the 
metabolic pathway would enable application of C. drakei as a po- 
tential industrial platform for utilization of cellulose-derived sug- 
ars such as xylose, arabinose, and rhamnose (11). 

Nucleotide sequence accession numbers. The draft genome 
sequence of C. drakei has been deposited at the DDBJ/EMBL/ 
GenBank database under the accession no. JIBUOOOOOOOO. The 
version described in this paper is the first version, JIBUOIOOOOOO. 
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